max rank | avg. rank | sentence |
---|---|---|
117 | 57.6667 | I would like to know if there are patients who need help. |
123 | 70.6667 | I know, it's just because it's new for me. |
151 | 58.6429 | But what information do we need and what would it take to get it? |
151 | 45.4444 | I don't have all the information for that day. |
163 | 62.0769 | What I need to know is A) is it still good to use? |
166 | 63.3333 | The drug does not have the right drug here? |
178 | 56.4444 | I'm sure that most of you have been there! |
180 | 87.5000 | But, you said you feel the symptoms more. |
184 | 91.5000 | If medications are used over the last time. |
208 | 61.8750 | But Do you take any medication for it and if so how do you like it? |
208 | 87.8750 | If you use two doctors with your medication. |
213 | 54.0000 | I take it, but don't know a lot about it. |
219 | 79.2857 | I know what the problems they experience. |
222 | 79.7000 | If you have to find the right meds for us. |
222 | 90.7778 | They were new meds and you should find it. |
223 | 69.5833 | This does not have to be in the Turks and Caicos Islands. |
223 | 94.0000 | What has this got to do with the Turks & Caicos Islands? |
223 | 84.0000 | What has this to do with the Turks & Caicos Islands? |
225 | 70.7500 | I find that the time I just tell them what I read. |
229 | 72.4444 | If you read this and they know you can't. |
230 | 108.3846 | Do you think you feel much better, then you must keep taking it. |
235 | 97.7500 | In case you experience any of those people. |
236 | 104.1667 | But then could they tell me -- its not that still didn't work. |
236 | 82.8182 | I have been through so much but I didn't know why. |
242 | 71.0769 | I am not sure which one but if you really want to know? |
245 | 102.1111 | I don't want to put our patients at risk. |
246 | 102.1429 | I made it through, made it work - but it was a very bad experience. |
246 | 77.8000 | They also have very bad for the way you do. |
251 | 127.2727 | Some work better for some people, others work better for others. |
252 | 90.0000 | The possible side effects, as you have given me. |
The maximum word rank of a sentence is by definition the rank of the rarest word in the sentence. If it is low, all words in the sentence are of high frequency. For this reason the table of the sentences with least maximum word number might be of interest. In the table, we see the corresponding sentences with a minimum length of 40 characters.
The over all distribution of the maximum rank in all sentences of the corpus is shown in a diagram with log-scaled x-axis.
The sentences in the table described above are of interest because they are usually easy to understand. The distribution may give insights into the corpus and may give parameters for language comparison.
While the distribution might be deduced from a small corpus, the sentences in the table are rare and a large corpus will give more impressive results.
Table data:
select max(w_id)-100 as m, avg(w_id)-100 as a, s.sentence from sentences s, inv_w i where s.s_id=i.s_id and length(sentence)>40 and i.w_id>100 group by s.s_id order by m limit 30;
Distribution data;
select m, count(*) from (select 100* round((max(w_id)-100)/100) as m from sentences s, inv_w i where s.s_id=i.s_id and i.w_id>100 group by s.s_id) aa group by m;
Explain the distribution, especially the increase in its right part.
4.5.2.2 Average word rank in sentence
4.5.2.3 Sentences consisting of many low frequency words I
4.5.2.4 Sentences consisting of many low frequency words II
4.5.2.5 Sentences consisting of short words only I
4.5.2.6 Sentences consisting of short words only II
4.5.2.7 Sentences consisting of long words only I
4.5.2.8 Sentences consisting of long words only II